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ABSTRACT 

The  robustness  of  "Student's"  t-test  with  regard  to  the 
assumption  of  normality  is  investigated.   This  is  accomplished 
by  empirically  developing  the  distribution  of  a  new  statistic 
from  a  large  number  of  samples  of  varying  sizes  from  the  Beta 
and  the  Gamma  distributions  using  different  values  of  the 
parameters.   The  various  significance  levels  o:  this  new 
distribution  are  then  compared  with  the  corresponding  signifi- 
cance levels  from  "Student's"  t  distribution  using  an  IBM  360 
computer.   A  comparison  of  the  distribution  frequencies  with 
the  standard  normal  distribution  frequencies  is  also  presented, 
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I .   INTRODUCTION 

In  practical  work,  an  Operations  Research  Specialist  is 
generally  involved  with  the  problems  of  bringing  the  theoret- 
ical structures  into  some  degree  of  correspondence  with  the 
situations  of  practical  experience.   One  of  the  common  methods 
of  achieving  this  objective  is  by  hypothesis  testing.   The 
hypothesis  testing  method  makes  use  of  a  variety  of  statisti- 
cal techniques  and  procedures  involving  both  parametric  and 
non-parametric  tests.   In  the  derivation  of  most  of  the  para- 
metric tests,  it  is  usual  to  assume  a  form  of  mathematical 
model  involving  some  specific  probability  distribution  and 
then  to  select  some  statistical  criterion  that  is  sensitive  to 
change  in  the  specific  factors  tested.   This  type  of  criterion 
is  generally  known  as  the  "power  of  the  test". 

However,  another  desirable  requirement  of  any  statistical 
test  is  that  it  be  insensitive  to  change  in  the  underlying 
assumptions.   This  property  of  the  test  is  generally  referred 
to  as  the  "robustness"  of  the  test.   Common  questions  probing 
into  the  robustness  of  a  test  are:   When  the  actual  distribu- 
tion is  not  known,  which  statistical  tests  can  be  used  with 
less  hesitation?;  and  What  percentage  of  error  would  be 
incurred  if  the  underlying  assumptions  are  violated?   Past 
research  and  subsequent  literature  indicate  that  the  property 
of  robustness  is  generally  not  satisfied  by  the  parametric 
tests.   This  is  substantiated  by  research  conducted  by  G.E.P. 
Box  [7]  and  R.  C.  Geary  [8], 

One  of  the  most  common  parametric  tests  used  for  hypothesis 
testing  is  "Student ' s"  t-test.   This  test  was  developed  by 
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William  Sealy  Gosset  in  1908  when  he  was  working  for  Guinness 
Brewery  in  Dublin,  Ireland.   The  discovery  of  the  t-test  was 
a  direct  result  of  the  peculiar  problems  of  brewing  with  its 
variable  material  and  its  susceptibility  to  temperature 
changes.   A  number  of  experiments  conducted  at  the  brewery 
emphasized  the  limitations  of  large  sample  theory  and  revealed 
the  necessity  for  a  correct  method  of  treating  small  samples. 
It  was,  therefore,  the  circumstances  of  Gosset' s  work  which 
led  to  his  discovery  of  the  Theory  of  Errors  and  the  distri- 
bution of  the  Sample  Standard  deviation,  which  was  developed 
later  into  the  well  known  t-test. 


II.   "STUDENT'S"  t  DISTRIBUTION 
As  was  mentioned  earlier,  the  Theory  of  Errors  finds  its 
origin  in  the  fact  that  the  accuracy  of  the  mean  of  a  number 
of  observations  may  be  estimated  from  the  discrepancies  observed 
among  the  individual  values  used  in  obtaining  the  mean.   When 

the  samples  are  drawn  from  a  normal  population,  having  a 

2 
variance  (mean  squared  deviation)  equal  to   a  ,  it  can  be 

shown  that  the  mean  of  N  observations  x  is  also  normally  dis- 

2     2 
tributed  with  variance   a_  =  —   .   Thus  if  the  variance  of  the 

x    N 

mean  is  known,  then  the  desired  information  regarding  the 
distribution  of  the  mean  can  be  easily  determined.   In  order 
to  test  a  hypothesis  that  the  population  has  mean  y  /  one 
would  merely  need  to  calculate  t  =  x^~   and  then  the  integral 


I  =  _± e  2   dt  would  give  the  probability  that  a  more 

/2^      J 

t 

discrepant  value  would  occur.   If  the  value  of  I  so  calculated 
turns  out  to  be  a  small  quantity,  such  as  .01,  one  should  con- 
clude with  some  confidence  that  the  hypothesis  was  not,  in 
fact,  true  of  the  population  sampled. 

In  a  majority  of  the  cases  in  which  such  tests  are  requir- 
ed, one  can  find  no  prior  knowledge  of  the  variance  of  the 
population.   The  variance  of  the  population  can  be  estimated, 

however,  from  the  sample  itself  by  the  following  relationships: 

2    T   N 
If  X-,  ,X9  , .  .  .  ,XM  represents  a  sample;  then  s   =  _ ±—  1       (X.-x) 
±      z  jn  N-l  ^_i  1 

2       -    l  N 

is  an  estimate  of  the  unknown  variance  a  ,  and  x  =  £  Z   X.  is 

N  i=l  X 
the  sample  mean  and  an  estimate  of  the  population  mean. 


W.  S.  Gosset,  in  his  fundamental  paper  of  1908,  showed 

that  even  though   s  as  calculated  above   is  a  good  estimate 

of  o  ,  it  would  be  erroneous  to  assume  that  the  statistic 

t  =  .  x~^   would  be  normally  distributed  or  that  the  signifi- 
S//N 

cance  of  an  observation  could  be  accurately  tested  by  using  the 

2 

normal  probability  theory.   Assuming  the  independence  of  s 

and  x  ,  Gosset  was  able  to  develop  the  exact  distribution  of 
this  statistic,  which  in  its  modern  form  is  known  as  "Student's" 
t  distribution.   It  states  that  if  X  and  U  are  two  independently 

distributed  random  variables  such  that  X  is  normally  distributed 

2 

with  mean  y  and  variance  a  ,  and  U  has  chi-square  distribution 

with  N  degrees  of  freedom,  then  the  ratio   t  =  — —  //U/N  has 
the  distribution  of  "Student's"  t,  and  its  density  function  is 
given  by 

rc*±i)     '      i 

f  (t)  =  — — « —      ,   where  -°o<t<«> 

, ,N  t2    N+l 

/wm  r(f)    [i  +  Tf]  — 

From  this  definition,  it  can  be  seen  that  "Student's" 
distribution  depends  on  N  only  and  is  independent  of  both 
parameters  of  the  normal  distribution  sampled.   It  is  further 
noted  that  the  derivation  of  "Student's"  t  distribution 
depends  on  the  fact  that  the  random  variable  X  be  drawn  from 
a  normal  population. 

One  of  the  common  uses  of  "Student's"  t-test  is  to  compare 
the  mean  of  a  population  with  some  standard.   In  practical 
problems,  however,  it  is  often  the  case  that  the  distribution 


of  the  population  being  sampled  is  not  known  or  is  not  normal. 
It  is  therefore  desirable  that  some  measure  of  the  degree  of 
error  be  obtained  for  "Student ' s"  t-test  when  the  assumption 
of  normality  of  the  sample  is  violated. 

Literature  reveals  that  a  great  deal  of  research  has  been 
directed  towards  many  aspects  of  "Student*  s"  distribution.   In 
1925 ,  R.  A.  Fisher  very  stringently  verified  the  independence 
of  the  sample  mean  x  and  the  sample  variance  s  .   This  was 
one  of  the  basic  assumptions  in  the  development  of  "Student's" 
distribution  [2] .   In  1936,   R.  C.  Geary  investigated  the 
distribution  of  "Student's"  t  ratio  for  the  samples  drawn  from 
a  slightly  asymmetrical  universe  by  developing  asymtotic  formu- 
lae for  the  moments  of  the  t  distribution.   He  showed  that  for 
symmetrical,  non-normal  populations,  "Student!  s"  distribution 
gives  more  accurate  results  than  skewed  sampling  distributions, 
(4)   In  1948,  Ac  K,  Gayen,  using  the  moments  techniques, 
studied  the  behavior  of  "Student's"  t  and  obtained  results 
similar  to  those  obtained  by  Re  C.  Geary  in  1936  [5] .   In  more 
recent  work,  Cucconi  0  developed  a  simple  relation  between  the 
critical  values  of  "Student's"  t  and  the  degrees  of  freedom. 
Through  this  relation,  in  addition  to  deriving  the  critical 
values  of  t  for  any  number  of  degrees  of  freedom,  he  showed 
that  it  is  possible  to  substitute  "Student' s"  criteria  in  the 
verification  of  statistical  hypothesis  more  easily  than  using 
a  standardized  normal  deviation  [9] .   These  are  but  a  few 
examples  of  the  diversified  attention  given  to  "Student's" 


t~test.   The  aim  of  this  paper  is  to  investigate  the  robustness 
of  "Student's"  t-test  when  the  sampling  data  is  drawn  from 
the  Beta  and  the  Gamma  distributions. 
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III.   METHODS  AND  PROCEDURES 
In  order  to  investigate  the  robustness  of  t-test,  it  was 
decided  to  select  two  well  known  distributions  with  a  wide 
scope  of  application  in  the  real  world.   The  two  distributions 
selected  are  the  Gamma  and  Beta  distributions.   A  brief 
description  of  these  distributions  follows. 

A.    GAMMA  DISTRIBUTION 

A  continuous  random  variable  Y  is  said  to  have  the  Gamma 
distribution  if  its  density  is 

IX 

f  (y)  =  i_^  y*-1   e  *      0<y<co 

r(a)3a 

=   0   ,   otherwise 
It  should  be  noted  that  this  is  a  two  parameter  family  of  dis- 
tributions with  parameters  a   and  3.   The  distribution  is  only 
defined  when  the  value  of  a>-l,  and  the  value  of  3>0.   The 

mean  and  variance  of  this  distribution  are  respectively  a3 

2  .  . 

and  a3  «   The  Gamma  distribution  is  frequently  applied  to  the 

solution  of  the  queuing  and  inventory  control  problems.   A 

graphical  representation  of  the  Gamma  distribution  for  various 

values  of  its  parameters  is  given  in  Appendix  A. 

One  of  the  special  and  most  often  used  family  of  Gamma 

distributions  is  called  the  Erlang  distribution.  In  order  to 

obtain  this  distribution,  the  values  of  the  parameter  a  in  the 

Gamma  distribution  are  restricted  to  positive  integer  values 

and  this  positive  integer  is  denoted  by  K.   If  one  replaces 
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3  by  y   i    then  the  distribution  becomes 

K-l   -Ay 

f  (y)  =   A(Ay>     e,      for  y>0 
1        (K-l) !  - 

=   0  ,  otherwise 
It  is  possible  to  show  that  the  latter  f (y)  can  be  generated 
by  the  sum  of  K  variates  Z  having  the  exponential  density 

f(z)  =   A  e  when  z>0 

=   0  ,  otherwise 
It  should  be  noted  that  when  K  =  1 ,  we  have  the  exponential 

density  as  a  special  case.   The  mean,  variance  and  mode  of  the 

K   K_     K-l 

Erlang  distribution  are  ,,  ^2  an^   ^    respectively.   The  great 

usefulness  of  this  distribution  stems  from  the  fact  that  it  is 
a  large  family  of  distributions  permitting  only  non-negative 
values.   The  waiting  time  and  the  service  time  distributions 
in  the  queuing  theory  can,  therefore,  be  reasonably  approximated 
by  an  Erlang  distribution. 

B.    THE  BETA  DISTRIBUTION 

A  continuous  random  variable  Z  is  said  to  have  a  Beta 
distribution  when 

f(z)  =  Ug^i  za_1(l-z)3"1  when  0<z<l 

r(a)T(3) 

=  0  ,  otherwise 
This  density  is  a  function  of  the  two  parameters  a  and  3/  both 
of  which  are  positive  constants.   The  mean  and  variance  of 


12 


this  distribution  are  given  by  -^—  and  — 1  *  0  ( P*— 


.   A 


rb 
where  £  =    (z-a) a (b-z) $dz 


a+£      (a+6+2)2(a+3+3) 
graphical  representation  of  the  Beta  distribution  for  various 
values  of  the  parameters  in  furnished  in  Appendix  B. 

It  may  be  pointed  out  that  when  a=$=0  the  distribution 
becomes  the  uniform  distribution  over  the  unit  interval.   The 
Beta  distribution,  though  in  general  defined  over  the  unit 
interval,  can  also  be  defined  over  any  interval  (a,  b)  as 
follows 

f(z)  =  £(z-a)a(b-z)5     where  a<z<p 
=  0  ,  otherwise 
rb 
a 

Primarily  because  the  Beta  distribution  is  defined  on  a 
finite  interval,  it  is  widely  used  in  the  study  of  critical 
path  scheduling  techniques  such  as  PERT.   It  may  be  recalled 
that  PERT  is  a  method  of  reflecting  the  uncertainties  assoc- 
iated with  development-type  tasks,  and  its  use  generally 
results  in  providing  a  more  realistic  outlook  for  task  accom- 
plishment than  the  conventional  systems  provided  in  the  past. 
In  order  to  obtain  the  critical  path  through  the  network,  it 
is  necessary  that  a  distribution  defined  between  the  pessimis- 
tic and  optimistic  time  estimates  adequately  represent  the 
distribution  of  time  required  to  perform  an  activity.   Since 
the  Beta  distribution  restricts  the  probability  distribution 
to  a  finite  range  for  varying  valves  of  the  parameters,  it  is 

widely  used  in  the  critical  path  scheduling. 
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C.    GENERATION  OF  DISTRIBUTIONS 

The  inverse  probability  integral  transformation  technique 

is  used  to  produce  random  variates  having  the  exponential 

distribution.   According  to  this  technique,  the  cumulative 

distribution  function  for  the  uniform  probability  density 

function  on  the  interval  (0,1)  is  equated  to  the  cumulative 

distribution  function  for  the  desired  probability  density 

function  in  order  to  obtain  the  necessary  transformation. 

Thus  if  RN  denotes  a  random  variable  on  the  interval  (0,1), 

and  X  is  the  desired  parameter  of  the  exponential  distribution; 

then  the  random  variable  obtained  from  the  transformation 

X  =    xn(RN)  jLs  exponentially  distributed  with  a  mean  y  =  ±. 
A  A 

1.  Erlang  Distribution 

In  order  to  generate  the  Erlang  distribution,  an 
extension  of  the  above  procedure  is  utilized  by  first  generat- 
ing exponential  random  variables  with  mean  ±   .   Then  K  of 

A 

these  exponential  variates  are  summed  using  the  relationship 

N 
X  =  E    &n(RN)  j   to  pr0(3uce  the  desired  Erlang  distribution 

i=l   "x 
with  mean  =  —  and  variance  — ^  . 

A  X^* 

2.  Beta  Distribution 

The  Beta  distribution  is  generated  using  the  following 
theoretical  ideas.   Consider  two  random  samples  X17Xp,...,X 
and  Yi'Y2' *  *  * /Ym  suc^  tnat  tne  random  variables  X  and  Y  are 
identically  distributed  exponential  with  parameter  X  .   If  we 
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n  m 

form  the  statistics  Z-i=£   X-  and  Z9=E    Y-  then 

i=l  1=1 

Zl  ^  F  (Z1;  n'y) 
Z2  'v  T  (z2;  m,y) 

and  their  joint  density  function  is  given  by 

zl+z2 
n-1  m-1  -( n — ) 

zl   z2   e 
g(z.,z  )  =  ,   0<z,<oo  and  0<zo<~ 

1   2     r(n)r(m)ym+n  2 

Zn 

Now  using  the  transformation  Wn  =  ± and  W^  =  Z..+Z  .  the 

y  1    Z1+Z2       2     12' 

joint  density  of  W-.  and  W2  is  given  by: 


iw«  w  ^    (wlw2)n  1w2m"1(l-w1) 
h(w1/w2)  - 


lT.7_m-l  /-,  .._  xm-1  -_£ 

e   ^  •  w. 


r(n)r(m)y 


m+n 


Integrating  over  the  domain  of  W2 ,  the  marginal  density  of  W,  ; 


h(w^)  =   h(w-,/w2)  dw2 


i.e.,    h(w1)  = 


n-1,,    vm-l 

Wl      (1-Wn) 


r(n)r(m)y 


n+m 


_w2 

n+m-1   "TT  n 
w2    e   M  dW2 


Using  the  substitution  U  =  W2/y  and  dW2  =  y  du  we  get: 


n-1  /n    .in-l 
w,   (1-Wn )    f00 


h(w,)  =  _J: ± 

1     r(n)r(m) 


u 


n+m-l~-u 


e"udu 


_   r  (n+m)    tt11"1/!  T7  ^ 

~  r(n)r(m)  wi  (1  wi} 


m-1 
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We  observe  that  W  =       has  the  Beta  distribution 

12 

with  a=n/  (3=m,  and  this  Beta  distribution  is  independent  of 

the  parameter  of  the  exponential  distribution  provided  that 

the  two  Gamma  distributions  are  generated  by  random  numbers 

having  identical  exponential  distributions.   Thus  to  generate 

the  Beta  distribution,  two  Gamma  distributions  with  parameters 

^K1'T^  anc^  (K2'T^  are  generate<3  using  the  above  techniques. 

Y-, 

From  these  distributions,  the  ratio  W-.  =  - — -—  is  formed  to 

J-   Yx+Y2 

Kl 
obtain  a  Beta  distributed  random  variable  with  mean  = 


K-i  K^ 
and  variance  


K1+K2 


(K1+K2)2(K1+K2+1) 


Dc    SIMULATION  TECHNIQUES 

The  accuracy  of  generating  random  observations  from  a 
probability  distribution  depends  upon  the  characteristics  of 
the  uniform  random  number  generator.   At  least  the  superficial 
characteristics  of  the  random  number  generator  are  ascertained 
by  the  use  of  chi-square  test  for  goodness-of-f it  at  95%  and 
99%  significance  levels.   For  the  chi-square  test,  the  unit 
interval  is  divided  into  fifty  equal  intervals.   Each  generated 
random  number  is  assigned  to  one  of  the  fifty  categories 
according  to  its  size.   From  this  data,  a  measure  of  the  dis- 
crepancy existing  between  observed  and  expected  frequencies  is 

7         T    (0._e.\2 
obtained  using  the  relationship  X^  =  E    v  J-   i'   where  o- 

i-1    e. 

denotes  the  observed  frequency  of  the  ith  category,  and  e. 
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denotes  the  expected  or  theoretical  frequencies  of  this  occur- 
ence; T  denotes  the  total  number  of  categories.   The  number  of 
degrees  of  freedom  for  the  statistics  is  equal  to  T-l,  where 
T  is  the  number  of  categories  or  classes. 

For  the  purpose  of  this  project,  four  simulated  distribu- 
tions were  obtained  for  each  type  of  distribution  sampled. 
Each  simulation  utilizes  5000  samples  and  a  fixed  sample  size. 
In  the  discussion  throughout  this  paper,  sample  size  is  referr- 
ed to  as  the  number  of  data  points  within  each  sample  used  to 
calculate  the  t  statistic  and  the  number  of  samples  denotes  the 
number,  of  replications  for  fixed  values  of  the  parameters  with- 
in each  cycle.   In  order  to  obtain  different  shapes  of  the 
distribution,  five  different  sets  of  values  of  the  parameters 
are  then  used  within  each  simulation. 

Since  the  basic  techniques  and  procedures  employed  in 
each  of  the  four  simulations  for  the  Erlang  and  the  Beta 
distributions  are  essentially  the  same,  it  is  considered 
adequate  to  discuss  the  methodology  of  one  simulation  for  each 
distribution  in  detail. 

1*   Erlang  Distribution 

Simulation  1  for  the  Erlang  distribution  consists 
of  four  steps.   Step  I  consists  of  five  cycles.   For  each 
cycle,  5000  samples  of  size  5  are  independently  generated 
from  the  Erlang  distribution  using  the  values  of  the  parameters 
as  shown  in  Table  1  below.   For  the  purpose  of  discussion, 
these  sampling  distributions  are  referred  to  as  y(K,X) 
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TABLE  1 


Cycle  &? 

Cycle    1 

Cycle    2 

Cycle    3 

Cycle   4 

Cycle   5 

Volue   of 
Porometers 

Y(K,X) 

K     =  4 
X     =  3 

K     =5 
X    =3 

K     =6 
X     =3 

K    =7 
X    =3 

K    =8 
X   =3 

Meon 

1.333 

1.667 

2.0 

2.333 

2.667 

Standard 
Deviation 

.6669 

.7408 

.8200 

.8624 

.9368 

To  illustrate  the  use  of  Table  1,  5000  samples  of 
size  5  are  generated  during  cycle  1  using  parameters  K  =4 ,    A 
=3.   The  mean  and  standard  deviation  of  this  Erlang  distri- 
bution are  given  in  rows  3  and  4  of  Table  1.   During  Cycle  2, 
another  5000  samples  of  size  5  are  independently  generated 
from  the  Erlang  distribution  using  parameters  K  =5,  X  =3.  The 
mean  and  standard  deviation  of  this  distribution  are  1.667 
and  .7408  respectively.   These  values  are  furnished  in  rows  3 
and  4  of  Table  1  under  the  heading  Cycle  2.   The  same  process 
is  repeated  for  Cycles  3,  4,  and  5  using  the  appropriate 

parameters . 

x-y 
In  Step  11/  for  each  sample  the  statistic  t  =  s , j~ 

is  calculated  where  x  is  sample  mean,  s  is  sample  standard 
deviation,  N  =  sample  size  (5) ,  and  y  is  population  mean  (1.33) 
The  absolute  values  of  this  statistic  are  then  compared  with 
the  corresponding  values  from  the  "Student's"  t-distribution 
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with  N-l  degrees  of  freedom  and  at  significance  levels  a=.8, 
.5,  .2,  .1,  .05,  .02,  .01,  and  .001.   These  values  of 
"Student's"  t-distribution  with  varying  degrees  of  freedom 
and  significance  levels  are  generally  referred  to  as  the 
critical  values  t   of  t-distribution  and  are  the  solutions  to 


the  equation  1-^= 


(ta 

f (t)dt  for  various  values  of  alpha.   The 


number  of  t  statistics  whose  absolute  values  are  greater  than 
the  corresponding  critical  values  at  each  significance  level 
are  recorded  for  comparison  purposes. 

In  order  to  fully  understand  the  comparisons  between 
the  actual  significance  levels  and  the  significance  levels 
obtained  from  the  Erlang  distribution,  it  is  necessary  that 
some  knowledge  regarding  the  comparisons  between  the  distri- 
bution frequencies  be  known.   This  is  accomplished  in  Step  III 
by  transforming  the  total  number  of  data  points  for  the  Erlang 
distribution  in  each  cycle  (5000  X  5  =  25,000)  using  the  trans- 

x±-y 

formation  — —  ,  i  =  1,25000,  where  u  and  a  are  the  sampling 
a 

distribution  mean  and  standard  deviation  respectively.   The 
transformed  distribution  is  called  y(0,l).   It  may  be  pointed 
out  that  if  the  sampling  distribution  were  normal,  with  mean  u 
and  standard  deviation  a  then  this  transformation  — ^—  would 
reduce  the  sampling  distribution  to  what  is  called  Standard 
Normal,  mean  zero,  variance  unity.   This  is  generally  denoted 
by  N(0,1).   The  transformed  distribution,  Y(0,1)  is  then 
divided  into  four  intervals  of  width  unity  on  either  side  of 
assumed  mean  of  zero,  and  the  frequency  counts  within  each 
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interval  are  recorded.   These  counts  are  then  compared  with 
the  expected  number  of  frequency  counts  within  each  interval 
assuming  that  the  sampling  distribution  was  normal. 

In  Step  IV,  the  chi-square  goodness-of-f it  test,  using 
a  significance  level  of  0.05,  is  conducted  on  the  results  of 
the  observed  significance  levels  and  the  distribution  frequen- 
cies as  calculated  in  Steps  II  and  III  above.   This  concludes 
Cycle  1  of  Simulation  1.   For  Cycles  2,  3,  4,  and  57  the  entire 
procedure  as  outlined  above  is  repeated  by  changing  the  para- 
meters of  the  Y(K,„^)  distribution  in  accordance  with  Table  1. 

Simulations  2,  3,  and  4  for  the  Erlang  distribution 
use  exactly  the  same  techniques  as  outlined  under  Simulation 
1  except  that  the  sample  sizes  are  10,  15,  and  20  respective- 
ly. 

2 .   Beta  Distribution 

Simulation  1  for  the  Beta  distribution  is  conducted 
in  four  steps.   Step  I  consists  of  five  cycles  which  are  shown 
in  Table  2.   For  each  cycle,  5000  samples  of  size  5  are  inde- 
pendently generated  from  each  of  the  two  Erlang  distributions 
using  fixed  values  of  the  parameters  as  shown  in  Table  2. 
These  distributions  are  referred  to  as  y_  (K, , X )  and  y    (K    ,X). 
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TABLE  2 


Cycle    Number 

Cycle  1 

Cycle   2 

Cycle    3 

Cycle  4 

Cycle   5 

Values  of 
Parometers 

y,(  K,,X  ) 

K,    =4 
X     =3 

K,    =4 
X     =3 

K,    -A 
X     =  3 

K,  =4 
X      s3 

K,    =4 
X      =  3 

Values  of 
Parameters 

X2(K2,X) 

Kg  =  4 
X      =3 

Kg   «5 
X      *3 

Kg    *6 
X      »3 

Kg  =  7 
X      «3 

Kg    =8 
X       =3 

Mean    (Beta) 

.5 

.4  4 

.40 

.36 

.33 

STD.  DEV.  (BHa) 

.167 

.158  9 

.150 

.140 

.1329 

To  illustrate  the  use  of  Table  2,  5000  samples  of 
size  5  are  generated  for  Yi(K^/A)  distribution  using  the  para- 
meters K-,  =4  and  X  =3.   Another  5000  samples  of  size  5  are 
independently  generated  for  Y2^K2'^^  distribution  using  para- 
meters K2  =4  and  X  =3  as  shown  in  Table  2.   During  Cycle  2, 
5000  samples  of  size  5  for  each  of  the  Yi(Ki/X)  and  Y2(K2'^ 
distributions  are  independently  generated  using  the  parameters 
K-.  -4 ,  X  =3  and  K2  =5,  X  =3  respectively.   These  values  are 
found  in  Table  2  under  the  heading  Cycle  2.   A  similar  process 
is  repeated  for  the  remaining  Cycles  3,  4,  and  5. 

For  each  cycle,  the  y-,   (K-,,X)  and  Y2^K2'^^  distributions 
are  combined  using  the  relationship  y,   (K-i  ,X)/(y-.  (K-,  ,  X)  + 
Y2(K2/M)  to  form  five  5000  samples  of  size  5  for  the  Beta 


distribution  having  mean 


Kl 


K1+K2 


and  variance 


Kl  K2 


(K1+K2)2(K1+K2+D 

The  actual  values  of  these  calculations  for  each  cycle  are 

shown  in  Table  2«   It  should  be  apparent  that  each  cycle 

generates  5000  samples  of  size  5  from  a  different  Beta 

distribution. 
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The  remaining  Steps  II,  III,  and  IV  of  this  simula- 
tion are  identical  with  Steps  II,  III,  and  IV  of  Simulation 
1  under  the  Erlang  distribution.   Simulations  2,  3,  and  4  for 
the  Beta  distribution  use  exactly  the  same  techniques  as 
outlined  in  Simulation  1  except  that  the  sample  sizes  are  10, 
15,  and  20  respectively. 

In  addition,  the  effects  of  changes  in  the  variance 
of  the  Beta  distribution,  when  the  mean  is  kept  constant,  are 
also  investigated.   This  is  accomplished  by  manipulating  the 
values  of  the  y-lCK-^A)  and  y2(K2,A)  parameters  that  result  in 
Beta  distributions  having  the  same  mean  but  different  variances 
The  sets  of  values  of  these  parameters  used  during  this  simu- 
lation are  furnished  in  Table  3  below. 

TABLE  3 


Cycle   # 

Cycle    1 

Cycle    2 

Cycle   3 

Cycle    4 

Cycle   5 

Parameters  of 

K,   -2 
X     s3 

K,    =4 
X     s3 

K,    =6 
X      s3 

K,   «  8 
X     =3 

K,    =  10 

X     ■  3 

Parameters  of 

Kg   =  2 
X      «3 

Kg  =  4 
X    *  3 

Kg  =6 
X     «3 

Kg  =8 
X      *3 

Kg  =  10 
X      =  3 

Mean    (Beta) 

.5 

.5 

.5 

.5 

.5 

STD.  DEV.    (Beta) 

.036 

.022 

.0195 

.013 

.011 
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IV.   RESULTS 

The  results  of  the  various  simulations  for  the  Erlang  and 
Beta  distributions  are  furnished  in  tabular  form  in  appendices 
C,  D,  E,  F,  G,  and  H.   Appendix  C  contains  Tables  4,  5,  6,  and 
7.   These  tables  contain  the  frequency  comparisons  of  the  dis 
tributions  obtained  in  these  simulations  and  from  Standard  Nor- 
mal designated  N(0,1),  when  samples  of  size,  5,  10,  15,  and  20 
are  used  respectively.   Tables  8,  9,  10,  and  11  in  Appendix  D 
contain  similar  results  obtained  from  the  use  of  Beta  distribu- 
tions.  Appendix  E  contains  Tables  12  and  13.   These  tables 
have  the  comparison  of  the  frequency  distributions  from  N(0,1) 
and  the  transformed  Beta (0,1)  when  the  mean  of  the  distribution 
is  kept  constant,  and  its  variance  is  manipulated. 

Appendices  F,  G,  and  H  contain  five  tables  each.   These 
tables  provide  the  necessary  comparisons  between  the  actual 
and  observed  significance  levels  for  a  fixed  mean  and  4,  9,  14, 
and  19  degrees  of  freedom.   Tables  14,  15,  16,  17,  and  18  in 
Appendix  F  pertain  to  Erlang  distribution  and  contain  the 
comparisons  of  the  actual  and  the  observed  significance  levels 
when  samples  of  size  5,  10,  15,  and  20  are  used  respectively. 
Tables  19,  20,  21,  22,  and  23  in  Appendix  G  contain  similar 
results  for  the  Beta  distributions.   Tables  24,  25,  26  27, 
and  28  in  Appendix  H  have  the  comparisons  of  the  significance 
levels  when  the  variance  of  the  Beta  distribution  is  manipu- 
lated for  a  fixed  mean. 

Since  there  are  but  two  different  table  formats  used  to 
illustrate  the  results,  it  is  considered  appropriate  to  discuss 
the  use  of  only  one  table  of  each  type  before  proceeding  with 
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the  discussion  of  the  results.   Tables  of  the  first  type, 
which  give  frequency  comparisons,  are  used  in  Appendices  C, 
D,  and  E.   The  second  type  of  table  is  used  to  provide  the 
comparisons  of  sigificance  levels,  and  these  tables  are 
contained  in  Appendices  F,  G,  and  H. 

As  mentioned  earlier,  Table  4  pertains  to  the  Erlang 
distribution.   This  table  is  divided  into  two  parts.   Part  I 
furnishes  the  means  and  the  standard  deviations  of  the  sampling 
distributions  when  the  samples  of  size  5  are  used.   The  means 
and  standard  deviations  for  each  distribution  are  identified 
by  means  of  index  numbers.   For  example,  Index  (1)  pertains 
to  the  Erlang  distribution  with  mean  1.33  and  standard  devia- 
tion .6669.   Part  II  of  this  table  furnishes  the  frequency 
distributions  of  each  Erlang  given  in  Part  I  and  the  frequency 
distributions  of  N(0,1).   In  order  to  relate  the  mean  and 
variance  with  the  corresponding  frequency  distributions,  the 
appropriate  index  should  be  consulted.   For  example,  if  one 
wishes  to  compare  the  frequency  distribution  of  an  Erlang  with 
mean  2.0  and  standard  deviation  .82,  when  using  samples  of 
size  5;   one  should  consult  Appendix  C,  Table  4.   Corresponding 
to  the  above  values,  one  would  select  Index  3  from  Part  I  of 
Table  4  and  obtain  the  frequency  distribution  from  the  row  (3) 
in  Part  II  of  this  table.   In  order  to  compare  this  frequency 
distribution  with  N(0,1),  one  would  compare  the  corresponding 
values  of  row  3  with  the  row  marked  N(o,l).   The  asterisks  on 
the  distribution  frequencies  indicate  that  the  distribution 
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was  found  to  be  significantly  different  than  N(0,1)  by  chi- 
square  goodness-of-f it  test  at  95%  confidence  level. 

The  second  type  of  table  is  found  in  Appendices  F,  G,  and 
H.   Table  15  in  Appendix  F  pertains  to  the  Erlang  distribution 
with  a  fixed  mean  of  1.667.   This  table  provides  the  comparison 
of  the  actual  significance  levels  and  the  observed  significance 
levels  when  the  sampling  population  is  Erlang  with  mean  1.667. 
This  table  furnishes  these  comparisons  for  four  values  of  de- 
grees of  freedom.   These  values  are  4,  9,  14,  and  19;  they 
are  found  in  the  first  column  of  the  table.   The  format  of  this 
table  is  very  similar  to  the  standard  t-table  found  in  most 
text  books.   In  order  to  make  use  of  this  table,  one  is  re- 
quired to  know  the  sample  size  and  the  population  mean.   For 
example,  if  one  is  interested  in  determining  the  comparison  of 
significance  levels;  when  sampling  from  Erlang  distribution 
with  mean  2.0  using  sample  size  15,  one  would  consult  Appendix 
F,  Table  16  and  obtain  the  required  information  in  the  row 
marked  14.   The  asterisks  on  the  observed  significance  values 
indicate  that  the  results  were  found  to  be  significantly 
different  at  95%  confidence  level  by  the  use  of  chi-square 
goodness-of-f it  test. 
RESULTS  OF  ERLANG 

The  results  shown  in  Tables  4,  5,  6,  and  7  indicate  that 
the  Erlang  frequency  distributions  are  significantly  different 
than  the  N(0,1)  at  5%  level.   A  comparison  of  the  observed 
significance  levels  with  the  actual  significance  levels  in 
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Table  14,  Appendix  F  reveals  that  when  an  Erlang  distribution 
with  mean  1.333  is  used  the  observed  alpha  values  are  signifi- 
cantly different  from  the  actual  alpha  values.   For  example, 
for  Sample  Size  5,  corresponding  to  the  actual  values  of  a  = 
.5,  .2,  and  .05;  the  observed  values  are  a   =  .034,  .004,  and 
^0004  respectively.   It  is  further  noted  that  as  the  sample 
size  is  increased  to  10,  15,  and  20,  the  corresponding  observed 
values  are  zero  in  each  case.   This  means  that  all  the  absolute 
values  of  t  statistic  are  smaller  than  the  critical  values. 

From  the  results  in  Table  15,  one  observes  that  for 
sample  size  5  at  a  =  .5,  .2,  .05,  the  observed  values  are  a  = 
.0292,  .002,  and  .0002  respectively.   Each  of  these  values  is 
smaller  than  the  corresponding  values  in  Table  14 .   The  results 
for  sample  size  10,  15,  and  20  are  the  same  as   found   in  Table 
14c   The  results  in  Tables  16  and  17  indicate  that  for  sample 
size  5,  the  corresponding  observed  values  in  each  case  are 
a  =  .0292,  .0034,  „0002  and  .0268,  .0028,  .0002  respectively. 
The  alpha  values  for  sample  sizes  10,  15,  and  20  remain  zero. 

In  examining  the  graphs  of  these  distributions,  one  ob- 
serves that  as  the  mean  of  the  distribution  is  increased;  the 
distribution,  though  flatter  than  the  normal  distribution, 
becomes  more  nearly  symmetric  around  the  mean.   However  as  the 

mean  of  the  distribution  is  increased,  the  variance  (  H_  )  is 

A2 

also  increased.   Consequently,  when  sampling  from  this  distri- 
bution, the  absolute  values  of  the  t  statistic,  x~^   ,  are 
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smaller  than  the  absolute  values  of  this  statistic  when  sampl- 
ing from  the  normal  population.   Furthermore,  these  values 
decrease  as  the  sample  size  is  increased. 

From  the  foregoing  results,  it  is  concluded  that  "Stu- 
dent's" t-test  is  sensitive  to  the  Erlang  distribution.   It  is 
further  observed  that  when  sampling  from  an  Erlang  distribu- 
tion,, the  resulting  t  statistic  has  significantly  shorter  tails 
than  the  "Student's"  t  distribution.   It  would  seem,  therefore, 
that  the  resulting  t  statistic  is  a  form  of  leptokurtic  curve. 
RESULTS  OF  BETA  DISTRIBUTION 

Results  in  Tables  8,  9,  10,  and  11  reveal  that  except 
when  the  sample  size  is  20;  and  the  distribution  is  symmetric 
around  the  mean  (.5),  the  Beta  distribution  frequencies  are 
significantly  different  from  the  N(0,1).  A  comparison  of  the 
results  in  Table  19,  Appendix  G  reveals  that  when  sampling 
from  the  Beta  distribution  with  mean  .5,  the  observed  alpha 
values  in  most  cases  are  not  significantly  different  from  the 
actual  alpha  values.   For  example,  for  sample  size  5,  the 
observed  alpha  values  are  only  significant  at  a  =  .8  and  .5. 
However,  when  the  sample  size  is  increased  to  10,  15,  and  20; 
the  results  are  only  significant  at  a  =  .8.   This  suggests 
that  when  sampling  from  a  symmetric  Beta  distribution,  using 
samples  of  size  5  cr  greater,  the  "Student's"  t-test  yields 
results  that  are  comparable  with  results  obtained  from  normal 
sampling  distributions. 

The  results  in  Table  20  reveal  that  when  sampling  from 
the  Beta  distribution  with  mean  .44,  the  observed  alpha  values 
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for  sample  size  10  remain  unchanged.   The  observe  alpha  values 
using  sample  size  5  are  significantly  different  at  a  =  .8, 
.5,  and  .2.   The  observed  alpha  values  for  sample  size  20  are 
significant  at  a  =  .8,  .5,  .1  and  .05.   Comparing  similar 
results  in  Tables  21  and  22,  one  observes  that  as  the  mean  of 
the  sampling  distribution  is  varied  to  the  left  of  the  mean, 
the  use  of  larger  sample  sizes  (10,  15)  yield  observed  alpha 
values  that  are  comparable  to  the  actual  alpha  values.   In 
Table  23,  one  observes  that  when  the  sampling  distribution  has 
mean  .33;  i.e.,  it  is  significantly  skewed  to  the  right,  the 
observed  alpha  values  for  sample  size  5  are  significantly 
different  than  the  actual  alpha  values.   The  results  for 
sample  size  10  are  reasonably  comparable  with  the  actual  alpha 
values. 

From  the  foregoing,  it  is  concluded  that  the  "Student's" 
t-test  is  sensitive  to  changes  in  the  shape  resulting  from 
changes  in  the  mean  of  the  distribution.   When  the  Beta  distri- 
bution is  symmetric  around  the  mean,  the  use  of  "Student's" 
t-test  yields  results  that  are  comparable  with  those  obtained 
when  sampling  from  the  normal  population.   It  is  further  noted 
that  most  of  the  observed  alpha  values  are  smaller  than  the 
actual  alpha  values.   This  suggests  that  the  values  of  the  t 
statistic  are  smaller  than  the  corresponding  values  of 
"Student's"  t  statistic.   It  would  seem  that  the  tails  of  the 
new  t  statistic  are  shorter  than  the  tails  of  "Student's"  t. 

The  results  of  the  Beta  distributions  with  fixed  mean 
and  different  variances  are  contained  in  Tables  24,  25,  26, 
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27,  and  28.   A  similar  investigation  of  these  results  indicates 
that  for  small  sample  size  (5)  the  observed  alpha  values  are 
significantly  different  than  the  true  values.   However,  as 
the  sample  size  is  increased  to  10,  the  difference  between  the 
observed  and  actual  alpha  values  is  significantly  reduced. 
This  suggests  that  "Student's"  t-test  is  sensitive  to  changes 
in  the  variance  when  small  samples  are  used. 
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V.   CONCLUSIONS  AND  RECOMMENDATIONS 
The  simulation  described  on  the  foregoing  pages  which 
investigated  the  robustness  of  "Student's"  t-test  has  proven 
to  be  extremely  realistic.   The  results  of  violating  the 
assumption  of  normality  by  the  use  of  the  Beta  and  the  Gamma 
distributions  indicate  that  when  the  sampling  distribution  is 
symmetric  around  the  mean,  the  use  of  "Student's"  t-test  gives 
results  that  are  closely  comparable  with  the  values  obtained 
by  the  use  of  normal  distributions.   This  implies  that 
"Student's"  t-test  is  reasonably  robust  when  it  is  used  with 
symmetric  populations.  -  The  results  also  indicate  that  in  the 
case  of  skewed  distributions,  the  observed  alpha  values  are 
significantly  different  from  the  actual  significance  levels. 
It  is  further  noted  that  in  the  case  of  symmetric  distributions, 
the  values  of  observed  significane  levels  approach  the  true 
values  as  the  sample  size  is  increased. 

In  the  case  of  symmetrical  distributions  with  a  fixed 
mean  and  different  variances,  it  is  observed  that  the  "Stu- 
dent's" t-test  is  sensitive  to  the  change  in  the  variance.   The 
observed  significance  levels  are  considerably  different  from 
the  true  values  as  the  variance  of  the  sampling  population  is 
increased.   However,  as  the  sample  size  is  increased,  the 
effects  of  change  in  the  variance  are  considerably  reduced. 
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APPENDIX  C 
Table  4 


POPULATION    Erlang 
SAMPLE  SIZE   5 


Part  I 

Popu] 

.ation  Parameters 

Mean 

1.33 

1.667 

2.0 

2.333 

2.667 

STD.DEV. 

,6669 

.74 

.82 

.86 

.93 

Index 

(1)  , 

(2) 

(3) 

(4) 

(5) 

Part  II    Comparisons  of  Distribution  Frequencies 


Inter- 
vals 

< 

y-3a 

\i-o 

\i-o 

\i-o 

\l  +  0 

]l+0 

y+a 

> 

y+3a 

N(0,1) 
Y  (0,1) 

74 

524 

3399 

8499 

8499 

3399 

524 

74 

Index 
(D* 

0 

0 

3581 

10575 

6991 

2793 

813 

247 

(2)* 

0 

10 

3653 

10250 

7244 

2820 

990 

233 

(3)* 

0 

26 

3699 

10161 

7262 

2841 

774 

237 

(4)* 

0 

62 

3797 

9743 

7469 

2948 

802 

179 

(5)* 

0 

75 

3824 

9762 

7390 

2490 

782 

177 
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APPENDIX  C 
Table  5 


POPULATION:   Erlang 
SAMPLE  SIZE:  10 


Part  I 


Population  Parameters 


.-     — — 

Mean 

1.333 

1.667 

2.0 

2.33 

2.667 

STD.DEV. 

.6604 

.749 

.83 

.905 

.9755 

Index 

(1)  . 

(2) 

(3) 

(4) 

(5) 

Part  II    Comparison  of  Distribution  Frequencies 


Inter- 
vals 

< 

y-3a 

y-3a 

y-2a 

\i-o 

\l+0 

y+2a 

y+3a 

> 

y+3a 

N(0,1) 
Y<0,1) 

149 

1049 

6799 

16999 

16999 

6799 

1049 

149 

Index 
(D* 

0 

1 

7294 

20977 

14043 

5495 

1670 

520 

(2)* 

0 

11 

7029 

20628 

14691 

5610 

1572 

459 

(3)* 

0 

39 

6854 

I 

20464 

15090 

5636 

1500 

417 

(4)* 

0 

46 

6702 

20443 

15197 

5819 

1447 

346 

(5)* 

0 

78 

6670 

20394 

15226 

5822 

1490 

320 

35 


APPENDIX  C 
Table  6 


POPULATION:   Erlang 
SAMPLE  SIZE:  15 


Part  I 


Population  Parameters 


Mean 

1.33 

1.667 

2.0 

2.333 

2.667 

STD.DEV. 

.6725 

.7677 

.8466 

.9173 

.9983 

Index 

(1)  . 

(2) 

(3) 

(4) 

(5) 

Part  II 

Compa. 

risons 

of  Distribut 

.ion  Fr 

equencies 

Inter- 
vals 

< 

y-3a 

y-3a 

y-2a 

\i-o 

y+o 

y+2a 

y+3o 

> 

y+3a 

N(0.1) 

Y(0,1) 

224 

1574 

10199 

25499 

25499 

10199 

1574 

224 

Index 
(D* 

0 

0 

10018 

31845 

21764 

8223 

2423 

724 

(2)* 

0 

4 

9739 

31562 

22509 

8283 

2216 

687 

(3)* 

0 

25 

9802 

31101 

22752 

8495 

2272 

553 

(4)* 

0 

54 

9750 

30661 

23129 

8783 

2139 

484 

(5)* 

0 

75 

9763 

30455 

23504 

8666 

2082 

455 
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APPENDIX  C 
Table  7 


POPULATION:   Erlang 
SAMPLE  SIZE:  20 


Part  I 

Population  Parameters 

Mean 

1.33 

1.667 

2.0 

2.333 

2.667 

STD.DEV. 

.6825 

.7710 

.8536 

.9232 

.9957 

Index 

t1? 

(2) 

(3) 

(4) 

(5) 

Part  II 

Comparisons 

of  Distribution  Frequencies 

Inter- 
vals 

< 

y-3a 

y-3a 

y-2a 

\i-o 

y+o 

u+2a 

y+3a 

> 

u+3a 

N(0,1) 
7(0,1) 

299 

2099 

13599 

33999 

33999 

13599 

2099 

299 

Index 
(D* 

0 

0 

12775 

43055 

29172 

11113 

3142 

943 

(2) 

0 

5 

12569 

42147 

30198 

11222 

3055 

809 

(3)* 

0 

30 

12625 

41768 

30649 

11277 

2897 

754 

(4)* 

0 

69 

12582 

41276 

31091 

11553 

2798 

631 

(5)* 

0 

109 

12587 

40938 

31459 

11636 

2705 

566 
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APPENDIX  D 
Table  8 


POPULATION:   Beta 
SAMPLE  SIZE:  5 


Part  I 

Population  Pa 

rameters 

Mean 

.5 

.44 

.40 

.36 

.33 

STD.DEV. 

.167 

.1589 

.50 

.140 

.1329 

Index 

(1), 

(2) 

(3) 

(4) 

(5) 

Part  II   Comparison  of  Distribution  Frequencies 


Inter- 
val 

< 

y-3a 

y-3a 

y-2a 

y-a 

\l+0 

y+2a 

y+3a 

> 

y+3a 

N(0,1) 

BetafCn 

74 

524 

3399 

8499 

8499 

3399 

524 

74 

(D* 

0 

411 

3879 

8062 

8333 

3896 

419 

0 

(2)* 

0 

298 

3894 

8414 

8146 

3670 

575 

3 

(3)* 

0 

194 

3890 

8739 

7987 

3515 

660 

15 

(4)* 

0 

151 

3942 

8919 

7849 

3445 

656 

38 

(5)* 

0 

100 

3919 

9111 

7736 

3382 

696 

56 

38 


APPENDIX  D 
Table  9 


POPULATION:   Beta 
SAMPLE  SIZE:  10 


Part  I 

Population  Parameters 

Mean 

.5 

.44 

.40 

.36 

.33 

STD.DEV. 

.167 

.157 

.148 

.1397 

.132 

Index 

(1) 

(2) 

(3) 

(4) 

(5) 

Part  II 

Compa 

rison  < 

Df  Distribution  Frequencies 

Inter- 
val 

< 

y-3a 

u-3a 

y-2a 

\i-o 

\l+0 

y+2a 

y+3a 

> 

y+3a 

N(o,l) 
3eta(0,l) 

149 

1049 

6799 

16999 

16999 

6799 

1049 

149 

Index 
(D* 

0 

802 

7637 

16467 

16467 

7726 

935 

0 

(2)* 

0 

629 

7841 

16813 

16123 

7418 

1164 

10 

(3)* 

0 

481 

7796 

14386 

15766 

7139 

1387 

45 

(4)* 

0 

325 

7879 

17801 

15558 

6916 

1457 

64 

(5)* 

0 

261 

7789 

18124 

15378 

6931 

1396 

121 
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APPENDIX  D 
Table  10 


POPULATION:   Beta 
SAMPLE  SIZE:  15 


Part  I 

Population  Parameters 

Mean 

.5 

.44 

.4 

.36 

.33 

STD . DEV . 

.166 

.1577 

.1477 

.1385 

.1308 

Index 
1 

(1). 

(2) 

(3) 

(4) 

(5) 

Part  II 

Comparisons 

of  Distribution  Fr 

■equencies 

Inter- 
val 

< 

y-3a 

y-3a 

y-2a 

]l-0 

u+a 

u+2a 

y+3o 

> 

y+3a 

N(0,1) 

Beta  (0,1) 

224 

1574 

10199 

25499 

25499 

10199 

1574 

224 

Index 
(D* 

0 

1225 

11574 

24441 

24621 

11812 

1327 

0 

(2)* 

0 

931 

11741 

25404 

23774 

11407 

1727 

16 

(3)* 

0 

683 

11820 

26111 

23470 

10762 

2105 

49 

(4)* 

0 

530 

11927 

26398 

23346 

10494 

2189 

116 

(5)* 

0 

429 

11841 

26916 

23037 

10283 

2275 

176 
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APPENDIX  D 
Table  11 


POPULATION:   Beta 
SAMPLE  SIZE:  20 


Part  I 

Population  Parameters 

Mean 

.5 

.44 

.40 

.36 

.33 

STD.DEV. 

.167 

.156 

.1469 

.138 

.1297 

Index 

d) 

(2) 

(3) 

(4) 

(5) 

Part  II 

Comparison  i 

Df  Distribution  Frequencies 

Inter- 
vals 

< 

y-3a 

y-3a 

y-2<j 

y-a 

\l+0 

y+2a 

y+3a 

> 

y+3a 

N(0,1) 

r  - 

Beta(0  .1! 

299 

2099 

13599 

33999 

33999 

13599 

2099 

299 

Index 
(D* 

0 

1641 

15458 

32592 

3271 

15798 

1780 

0 

(2)* 

0 

1252 

15714 

33539 

32161 

14858 

2447 

29 

(3)* 

0 

970 

15740 

34596 

31365 

14505 

2721 

103 

(4)* 

0 

766 

15774 

35284 

31087 

13905 

3005 

179 

(5)* 

0 

586 

15896 

35825 

30^-99 

13853 

3094 

247 

41 


APPENDIX  E 
Table  12 
POPULATION :   Beta 
SAMPLE  SIZE:  5  (Fixed  Mean,  Difference  Variance) 


Part  I 


Population  Parameters 


■           — 

Mean 

.5 

.5 

.5 

.5 

.5 

Variance 

.1036 

.022 

.0195 

.013 

.0108 

Index           (1) 

(2) 

(3) 

(4) 

(5) 

Part  II 

Comparison  of  Distribution  Frequencies 

Inter- 
val 

< 

u-3a 

u-3a 

y-2a 

\i-o 

\i+o 

y+2a 

y+3a 

> 

y+3a 

N(0,1) 
Beta(0,l) 

74 

524 

3399 

8499 

8499 

3399 

524 

74 

Index 
(D* 

0 

166 

4394 

7941 

7764 

4572 

165 

0 

(2)* 

0 

397 

3908 

8075 

8336 

3846 

436 

0 

(3)* 

3 

430 

3584 

8573 

8456 

3659 

493 

2 

(4)* 

3 

439 

3573 

8399 

8492 

3597 

486 

11 

(5)* 

9 

431 

3456 

8496 

8575 

3581 

439 

11 
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APPENDIX  E 
Table  13 
POPULATION:   Beta 
SAMPLE  SIZE:  10  (Fixed  Mean,  Difference  Variance) 


Part  I 

Popul 

ation  Parameters 

Mean 

,5 

.5 

.5 

.5 

.5 

Variance 

,036 

.022 

.0195 

.013 

.0108 

Index 

(1). 

(2) 

(3) 

(4) 

(5) 

Part  II   Comparison  of  Distribution  Frequencies 


Inter- 
vals 

< 

y-3a 

y-3a 

y-2a 

\i-o 

\i+o 

y+2a 

y+3a 

> 

y+3a 

N(0,1) 

Beta(O.l) 

149 

1049 

6799 

16999 

16999 

6799 

1049 

149 

Index 
(D* 

0 

399 

8788 

15650 

15758 

8963 

442 

0 

(2)* 

0 

838 

7653 

16251 

16653 

7811 

894 

0 

(3)* 

8 

861 

7361 

L6598 

16669 

7534 

959 

11 

(4)* 

15 

946 

7026 

L6749 

16940 

7318 

985 

21 

(5) 

12 

919 

6985 

L6834 

16964 

7226 

1033 

27 
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APPENDIX  F 

Table  14 

POPULATION : 

Erlang 

POPULATION  MEAN: 

1.33 

Degrees 

of 
Freedom 

VALUES  OF  i-< 

X 

.2 

.5 

.8 

.9 

.95 

.98 

.99 

.999 

4 

.7416* 

.9660* 

.9960* 

.9982* 

.9996* 

1.0* 

1.0* 

1.0 

9 

.9586* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 

14 

.9946* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 

19 

.9996* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 

44 


APPENDIX  F 
Table  15 


POPULATION:        Erlang 
POPULATION  MEAN:   1.667 


Degrees 

of 
Freedom 

VALUES  OF  1-a 

.2 

.5 

.8 

.9 

.95 

.98 

.99 

.999 

4 

.7480* 

.9708* 

.9980* 

.9998' 

.9998* 

1.0* 

1.0* 

1.0 

9 

.9650* 

.998* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 

14 

.9952* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 

19 

.9998* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 
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APPENDIX  F 

Table  16 

POPULATION: 

Erlang 

POPULATION  MEAN: 

2.0 

Degrees 

of 
Freedom 

VALUES  OF  1-a 

.2 

.5 

.8 

.9 

.95 

.98 

.99 

.999 

4 

.7466* 

.9708* 

.9966* 

.9994* 

.9998* 

1.0* 

1.0* 

1.0 

9 

.9608 

.999* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 

.14 

.9958* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

19 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 
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APPENDIX  F 

Table  17 

POPULATION: 

Erlang 

POPULATION  MEAN: 

2.33 

Degrees 
of 

Freedom 

VALUES  OF  1-a 

.2 

.5 

.8 

.9 

.95 

.98 

.99 

.999 

4 

.7394* 

.9732* 

.9972* 

.9994* 

.9998* 

1.0* 

1.0* 

1.0 

9 

.9640* 

.999* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 

14 

.9970* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 

19 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 
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APPENDIX  F 

Table  18 

POPULATION: 

Erlang 

POPULATION  MEAN: 

2.667 

Degrees 

of 
Freedom 

VALUES  OF  1-a 

.2 

.5 

.8 

.9 

.95 

.98 

.99 

.999 

4 

.754* 

.9750* 

.9982* 

.998* 

1.0* 

1.0* 

1.0* 

1.0 

9 

.9656* 

.999* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 

14 

.9962* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 

19 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0* 

1.0 
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APPENDIX  G 
Table  19 


POPULATION:        BETA 
POPULATION  MEAN:   .5 


Degrees 

of 
Freedom 

ACTUAL  SIGNIFICANCE  LEVELS 

.8 

.5 

.2 

.1 

.05 

.02 

.01 

.001 

4 

.793* 

.485* 

.196 

.099 

.051 

.020 

.011 

.002 

9 

.795* 

.497 

.199 

.099 

.051 

.023 

.014 

.002 

14 

.807* 

.498 

.196 

.095 

.050 

.040 

.010 

.001 

19 

.809* 

.501 

.199 

.102 

.051 

.021 

.01 

.001 
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APPENDIX  G 
Table  20 


POPULATION:        Beta 
POPULATION  MEAN:   .44 


Degrees 
of 

Freedom 

ACTUAL  SIGNIFICANCE  LEVELS 

.8 

.5 

.2 

.1 

.05 

.02 

.01 

.001 

4 

.793* 

.480* 

.190* 

.099 

.050 

.021 

.011 

.001 

9 

.805* 

.053 

.200 

.102 

.050 

.021 

.011 

.001 

14 

.799 

.489* 

.193* 

.098 

.050 

.041 

.012 

.001 

19 

.805* 

.509* 

.200 

.109* 

.058* 

.023 

.02 

.001 
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APPENDIX  G 
Table  21 


POPULATION :        Beta 
POPULATION  MEAN:   .40 


Degrees 

of 
Freedom 

ACTUAL 

SIGNIFICANCE 

LEVELS 

.8 

.5 

.2 

.1 

.05 

.02 

.01 

.001 

4 

.793* 

.488* 

.192* 

.096 

.050 

.020 

.010 

.001 

9 

.802 

.491* 

.192* 

.096 

.048 

.021 

.010 

.001 

14 

.803* 

.503 

.196 

.101 

.053 

.043 

.011 

.002 

19 

.791* 

.494* 

.189 

.094 

.046 

.018 

.01 

.002 

51 


APPENDIX  G 
Table  22 


POPULATION:        Beta 
POPULATION  MEAN:   .36 


Degrees 
of 

Freedom 

ACTUAL  SIGNIFICANCE  LEVELS 

.8 

.5 

.2 

.1 

.05 

.02 

.01 

.001 

4 

.798 

.492* 

.204 

.098 

.053 

.020 

.010 

.001 

9 

.793* 

.494* 

.194 

.102 

.054 

.021 

.012 

.001 

14 

.795* 

.507* 

.204 

.100 

.051 

.043 

.011 

.002 

19 

.803* 

.501 

.196 

.094 

.050 

.022 

.01 

.001 
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APPENDIX  G 
Table  23 


POPULATION :        Beta 
POPULATION  MEAN:   .33 


Degrees 

of 
Freedom 

ACTUAL 

SIGNIFICANCE 

LEVELS 

.8 

.5 

.2 

.1 

.05 

.02 

.01 

.001 

4 

.801 

.504 

.205 

.107* 

.061* 

.023* 

.012* 

.003* 

9 

^793* 

.504 

.200 

.101 

.052 

.023 

.011 

.001 

14 

.796* 

.507* 

.209* 

.106 

.052 

.046 

.013 

.002 

19 

.794* 

.493* 

.193* 

.103 

.0.-4 

.023 

.02 

.001 
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POPULATION : 
POPULATION  MEAN: 
POPULATION  VARIANCE: 


APPENDIX  H 
Table  2  4 
Beta  (Fixed  Mean-  Different  Variance) 
.5 
.036 


Degrees 
of 

Freedom 

ACTUAL  SIGNIFICANCE  LEVELS 

.8 

.5 

.2 

.1 

.05 

.02 

.01 

.001 

4 

.7860* 

.4796* 

.2084* 

.1160* 

.0682* 

.0314* 

.0176* 

.0012 

9 

.7952* 

.4870* 

.1948 

.1054 

.0562* 

.0244 

.0132 

.0018 
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POPULATION: 
POPULATION  MEAN: 
POPULATION  VARIANCE: 


APPENDIX  H 
Table  25 
Beta  (Fixed  Mean-Different  Variance) 
.5 
.022 


Degrees 

of 
Freedom 

ACTUAL  SIGNIFICANCE  LEVELS 

.8 

.5 

.2 

.1 

.05 

.02 

.01 

.001 

4 

.7874* 

.4842* 

.2002 

.1052 

.0538 

.0238 

.0118 

.0012 

9 

.8008 

.4934 

.1992 

.0998 

.0538 

.0220 

.0130 

.0012 
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POPULATION: 
POPULATION  MEAN: 
POPULATION  VARIANCE: 


APPENDIX  H 
Table  26 
Beta  (Fixed  Mean-Different  Variance) 
.5 
.0195 


Degrees 

of 
Freedom 

ACTUAL  SIGNIFICANCE  LEVELS 

.8 

.5 

.2 

.1 

.05 

.02 

.01 

.001 

4 

.7908* 

.4914* 

.1948 

.1068 

.0556 

.0224 

.0102 

.0008 

9 

,8012 

.5042 

.1956 

.0976 

.0486 

.0220 

.0094 

.0008 
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APPENDIX  H 
Table  27 


POPULATION:  Beta 

POPULATION  MEAN:       .5 
POPULATION  VARIANCE:   .013 


Degrees 
of 

Freedom 

ACTUAL  SIGNIFICANCE  LEVELS 

.8 

.5 

.2 

.1 

.05 

.02 

.01 

.001 

4 

.7864* 

.4824* 

.1868* 

.0924* 

.0484 

.0188 

.0084 

.0002 

9 

.7910* 

.4908* 

.1940 

.0970 

.0492 

.0196 

.0112 

.0018 
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POPULATION: 
POPULATION  MEAN: 
POPULATION  VARIANCE 


APPENDIX  H 
Table  28 
Beta 
.5 
.0108 


Degrees 
of 

Freedom 

ACTUAL  SIGNIFICANCE  LEVELS 

.8 

.5 

.2 

.1 

.05 

.02 

.01 

.001 

4 

.8026* 

.4906* 

.2004 

.1 

.0498 

.0204 

.0104 

.0014 

9 

.7910 

.5030 

.2010 

.0974 

.0464 

.0172 

.0090 

.0008 
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APPENDIX  J 
Instructions  for  the  Use  of  Computer  Program 

The  attached  computer  program  was  used  to  obtain  the 
simulation  results  for  the  Beta  distribution.   The  program, 
written  in  Fortran  language,  makes  use  of  an  IBM  360  computer. 
This  program  consists  of  four  simulations,  requires  47  minutes 
of  execution  time,  and  makes  use  of  505  K  storage  capacity. 
The  program  as  presented  in  Appendix  J,  is  self  sufficient  in 
that  it  does  not  require  any  external  sub-routines  or  programs 
for  its  execution.   The  program  contains  its  own  random  number 
generator  with  the  variant  seed  characteristics.   It  enables 
the  user  to  manipulate  the  various  parameters  without  any 
reorganization  of  the  program. 

To  assist  users  in  understanding  and  utilizing  various 
facets  of  the  program,  a  number  of  comment  cards  are  used 
through-out  the  program.   In  order  to  illustrate  the  techniques 
of  varying  the  parameters  of  the  Erlang  distribution  or  the 
number  of  cycles  in  the  simulation,  or  the  number  of  samples 
for  the  distributions,  the  following  instructions  could  be 
useful.   If  one  consults  Appendix  J,  page  62,  he  should  find 
that  parameters  B^  =  3  corresponds  to  X   =   3,  and  the  values  of 
Kq_  and  K2  represent  the  values  of  parameter  K  in  the  Erlang 
distribution.   In  order  to  obtain  five  different  Erlang  distri- 
butions in  each  cycle  of  each  simulation  (as  illustrated  in 
Table  2) ,  one  would  set  K,  =  4 ,  K?  =  3  and  make  use  of  the  Do 
loop  777.   Thus  in  order  to  change  the  number  of  cycles  to  3, 
one  would  simply  replace  Do  777  IB=1,5  with  Do  777  IB=1 f 3 . 
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This  program  is  designed  to  furnish  4  different  sample 
sizes.   This  is  accomplished  by  the  first  Do  loop  on  page  62  ; 
i.e.,  Do  778  IN=1,4.   The  sample  size  is  determined  by  the 
value  of  the  increment  in  statement,  MR  =  MR  +5.   If  four 
samples  of  sample  size  2,  4,  6,  and  8  are  required,  one  would 
simply  change  the  statement  MR  =  MR  +  2.   If  only  two  samples 
of  5000  each  are  desired,  then  one  would  replace  Do  778  IN  = 
1,4  by  Do  778  IN  =  1,2.   The  number  of  samples  in  each  simula- 
tion are  determined  by  the  Do  loop  "Do  1000  II  =  1,50.   As  is 
apparent  from  the  dimensions  of  GAMMA1  and  GAMMA2,  the  program 
for  given  values  of  K^  and  K2  generates  100  samples  of  size 
MR  which  are  Beta  distributed.   It  calculates  the  new  t 
statistic,  compares  the  absolute  values  of  this  statistic 
with  the  critical  values,  and  stores  the  beta  variate  in 
location  called  "STOR".   Utilizing  a  different  seed,  the  entire 
process  is  repeated  50  times,  thus  yielding  a  total  of  5000 
samples  of  size  MR.   After  the  generation  of  5000  samples  which 
are  located  in  "STOR",  the  distribution  frequency  comparisons 
are  performed;  and  the  desired  results  are  provided  as  indi- 
cated in  Appendix  J.   The  values  of  parameters  are  then 
incremented  by  the  Do  loop  778,  and  the  entire  process  is 
repeated  until  5  cycles  are  completed  for  the  value  of  sample 
size.   At  the  end  of  Simulation  I,  the  sample  size  is  incre- 
mented, and  the  entire  process  is  repeated. 

A  sample  of  the  results  of  one  simulation  are  furnished 
in  Appendix  J.   It  may  be  recalled  that  each  simulation 
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consists  of  five  cycles;  for  each  cycle,  the  program  furnishes 
the  following  information  as  shown  in  Appendix  J: 

1.  Transformed  Beta  frequency  distribution  denoted  by  fl)  . 

2.  N(0,1)  frequency  distribution  denoted  by  (2)  . 

3.  Sample  size  denoted  by  (3J    . 

4.  Distribution  mean  denoted  by  (Y)  . 

5.  The  actual  levels  of  significance  denoted  by  (5)  . 

6.  The  observed  levels  of  significance  denoted  by  (j>)    . 

7.  The  X2  (1)  statistic  for  each  level  of  significance 

.95 

denoted  by  \7J    . 
At  the  end  of  the  simulation,  a  summary  of  the  results  is 

furnished  which  contains  the  actual  alpha  values,  the  observed 

2 

alpha  values  for  each  of  the  five  cycles,  X95  values  for  one 

2 

degree  of  freedom,  and  the  observed  X  (1)  statistics  for  each 

significance  level.   The  computer  program  for  the  Gamma 
distribution  also  utilizes  the  same  techniques.   This  program 
is  furnished  in  Appendix  J. 
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COMPUTER    PROGRAM    FOR    ERLANG    DISTRIBUTION 
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